SemanticScuttle - klotz.me » klotz: hugging face

klotz: hugging face*

Open-R1: a fully open reproduction of DeepSeek-R1

Hugging Face's initiative to replicate DeepSeek-R1, focusing on developing datasets and sharing training pipelines for reasoning models.

The article introduces Hugging Face's Open-R1 project, a community-driven initiative to reconstruct and expand upon DeepSeek-R1, a cutting-edge reasoning language model. DeepSeek-R1, which emerged as a significant breakthrough, utilizes pure reinforcement learning to enhance a base model's reasoning capabilities without human supervision. However, DeepSeek did not release the datasets, training code, or detailed hyperparameters used to create the model, leaving key aspects of its development opaque.

The Open-R1 project aims to address these gaps by systematically replicating and improving upon DeepSeek-R1's methodology. The initiative involves three main steps:

1. **Replicating the Reasoning Dataset**: Creating a reasoning dataset by distilling knowledge from DeepSeek-R1.
2. **Reconstructing the Reinforcement Learning Pipeline**: Developing a pure RL pipeline, including large-scale datasets for math, reasoning, and coding.
3. **Demonstrating Multi-Stage Training**: Showing how to transition from a base model to supervised fine-tuning (SFT) and then to RL, providing a comprehensive training framework.

2025-01-28 Tags: open-r1, deepseek-r1, hugging face, reinforcement learning, llm, open source by klotz

Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens

Alibaba's Qwen 2.5 LLM now supports input token limits up to 1 million using Dual Chunk Attention. Two models are released on Hugging Face, requiring significant VRAM for full capacity. Challenges in deployment with quantized GGUF versions and system resource constraints are discussed.

2025-01-28 Tags: qwen2.5-1m, alibaba, hugging face, gguf, llm, simon willison by klotz

Introducing smolagents, a simple library to build agents

smolagents is a simple library that enables agentic capabilities for language models, allowing them to interact with external tools and perform tasks based on real-world data.

2024-12-31 Tags: smolagents, agents, llm, code, hugging face by klotz

Hugging Face Just Released SmolAgents: A Smol Library that Enables to Run Powerful AI Agents in a Few Lines of Code

Hugging Face's SmolAgents simplifies the creation of intelligent agents by allowing developers to build them with just a few lines of code using powerful pretrained models.

2024-12-31 Tags: hugging face, smolagents, agents, llm by klotz

How to Build a Text Classification Model using Hugging Face Transformers

A detailed guide on creating a text classification model with Hugging Face's transformer models, including setup, training, and evaluation steps.

2024-12-17 Tags: text classification, hugging face, transformers, machine learning, nlp by klotz

HunyuanVideo: A Systematic Framework For Large Video Generation Model Training

HunyuanVideo is an open-source video generation model that showcases performance comparable to or superior to leading closed-source models. It includes features like a unified image and video generative architecture, a large language model text encoder, and a causal 3D VAE for spatial-temporal compression.

2024-12-05 Tags: hunyuanvideo, text-to-video, llm, hugging face, tencent, machine learning by klotz

Gradio 5 is here: Hugging Face’s newest tool simplifies building AI-powered web apps

Hugging Face launches Gradio 5, a major update to its popular open-source tool for creating machine learning applications, aimed at making AI development more accessible and secure for enterprises.

2024-10-10 Tags: gradio, hugging face, machine learning, webdev, ui by klotz

WordLlama Released on Hugging Face: An Open Source, Fast, Lightweight (16MB) NLP Toolkit for Tasks like Fuzzy Deduplication, Similarity and Ranking Optimized for CPUs

The release of WordLlama on Hugging Face marks a pivotal moment in natural language processing (NLP). This advanced language model is designed to offer developers, researchers, and businesses a highly efficient and accessible tool for various NLP applications.

2024-09-20 Tags: wordllama, hugging face, nlp, deduplication, similarity, ranking by klotz

NuExtract

NuExtract is a fine-tuned version of phi-3-mini for information extraction. It requires a JSON template describing the information to extract and an input text. Provides tiny (0.5B) and large (7B) versions.

2024-08-22 Tags: information extraction, phi-3, json, numind, llm, hugging face by klotz

Unified Tool Use for LLM

Hugging Face introduces a unified tool use API across multiple model families, making it easier to implement tool use in language models.

Hugging Face has extended chat templates to support tools, offering a unified approach to tool use with the following features:

- Defining tools: Tools can be defined using JSON schema or Python functions with clear names, accurate type hints, and complete docstrings.
- Adding tool calls to the chat: Tool calls are added as a field of assistant messages, including the tool type, name, and arguments.
- Adding tool responses to the chat: Tool responses are added as tool messages containing the tool name and content.

2024-08-13 Tags: llm, functions, tools, template, hugging face, api, automation, production engineeering by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: hugging face*

Linked Tags

Related Tags